Benchmarking ligand-based virtual High-Throughput Screening with the PubChem database.

نویسندگان

  • Mariusz Butkiewicz
  • Edward W Lowe
  • Ralf Mueller
  • Jeffrey L Mendenhall
  • Pedro L Teixeira
  • C David Weaver
  • Jens Meiler
چکیده

With the rapidly increasing availability of High-Throughput Screening (HTS) data in the public domain, such as the PubChem database, methods for ligand-based computer-aided drug discovery (LB-CADD) have the potential to accelerate and reduce the cost of probe development and drug discovery efforts in academia. We assemble nine data sets from realistic HTS campaigns representing major families of drug target proteins for benchmarking LB-CADD methods. Each data set is public domain through PubChem and carefully collated through confirmation screens validating active compounds. These data sets provide the foundation for benchmarking a new cheminformatics framework BCL::ChemInfo, which is freely available for non-commercial use. Quantitative structure activity relationship (QSAR) models are built using Artificial Neural Networks (ANNs), Support Vector Machines (SVMs), Decision Trees (DTs), and Kohonen networks (KNs). Problem-specific descriptor optimization protocols are assessed including Sequential Feature Forward Selection (SFFS) and various information content measures. Measures of predictive power and confidence are evaluated through cross-validation, and a consensus prediction scheme is tested that combines orthogonal machine learning algorithms into a single predictor. Enrichments ranging from 15 to 101 for a TPR cutoff of 25% are observed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ligand Based Pharmacophore Model Development for Anti Androgen Receptor Drugs for Treating Prostate Cancer

The computational modeling and high throughput screening techniques have been used to identify small molecules that specifically target functional surface sites of the androgen receptor in Prostate cancer. Pharmacophore modeling, Virtual screening, docking based analyses is used for development of new chemical entities. The purpose of the current work is to establish pharmacophore model for the...

متن کامل

Challenges in Secondary Analysis of High Throughput Screening Data

Repurposing an existing drug for an alternative use is not only a cost effective method of development, but also a faster process due to the drug's previous clinical testing and established pharmokinetic profiles. A potentially rich resource for computational drug repositioning approaches is publically available high throughput screening data, available in databases such as PubChem Bioassay and...

متن کامل

PubChem BioAssay: 2014 update

PubChem's BioAssay database (http://pubchem.ncbi.nlm.nih.gov) is a public repository for archiving biological tests of small molecules generated through high-throughput screening experiments, medicinal chemistry studies, chemical biology research and drug discovery programs. In addition, the BioAssay database contains data from high-throughput RNA interference screening aimed at identifying cri...

متن کامل

DRABAL: novel method to mine large high-throughput screening assays using Bayesian active learning

BACKGROUND Mining high-throughput screening (HTS) assays is key for enhancing decisions in the area of drug repositioning and drug discovery. However, many challenges are encountered in the process of developing suitable and accurate methods for extracting useful information from these assays. Virtual screening and a wide variety of databases, methods and solutions proposed to-date, did not com...

متن کامل

NCBI PubChem BioAssay Database

NCBI’s PubChem BioAssay database (1-5) (http://pubchem.ncbi.nlm.nih.gov) is a public repository for archiving biological tests of small molecules and siRNA reagents. Small molecule bioactivity data contained in the BioAssay database consist of information generated through high-throughput screening experiments, medicinal chemistry studies, chemical biology research, as well as literature curati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Molecules

دوره 18 1  شماره 

صفحات  -

تاریخ انتشار 2013